Comparing Exact and Approximate Spatial Auto-regression Model Solutions for Spatial Data Analysis

نویسندگان

  • Baris M. Kazar
  • Shashi Shekhar
  • David J. Lilja
  • Ranga Raju Vatsavai
  • R. Kelley Pace
چکیده

The spatial auto-regression (SAR) model is a popular spatial data analysis technique, which has been used in many applications with geo-spatial datasets. However, exact solutions for estimating SAR parameters are computationally expensive due to the need to compute all the eigenvalues of a very large matrix. Recently we developed a dense-exact parallel formulation of the SAR parameter estimation procedure using data parallelism and a hybrid programming technique. Though this parallel implementation showed scalability up to eight processors, the exact solution still suffers from high computational complexity and memory requirements. These limitations have led us to investigate approximate solutions for SAR model parameter estimation with the main objective of scaling the SAR model for large spatial data analysis problems. In this paper we present two candidate approximate-semi-sparse solutions of the SAR model based on Taylor series expansion and Chebyshev polynomials. Our initial experiments showed that these new techniques scale well for very large data sets, such as remote sensing images having millions of pixels. The results also show that the differences between exact and approximate SAR parameter estimates are within 0.7% and 8.2% for Chebyshev polynomials and Taylor series expansion, respectively, and have no significant effect on the prediction accuracy.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bayesian Analysis of Survival Data with Spatial Correlation

Often in practice the data on the mortality of a living unit correlation is due to the location of the observations in the study‎. ‎One of the most important issues in the analysis of survival data with spatial dependence‎, ‎is estimation of the parameters and prediction of the unknown values in known sites based on observations vector‎. ‎In this paper to analyze this type of survival‎, ‎Cox...

متن کامل

A Comparison of Thin Plate and Spherical Splines with Multiple Regression

Thin plate and spherical splines are nonparametric methods suitable for spatial data analysis. Thin plate splines acquire efficient practical and high precision solutions in spatial interpolations. Two components in the model fitting is considered: spatial deviations of data and the model roughness. On the other hand, in parametric regression, the relationship between explanatory and response v...

متن کامل

Spatial Dependency Modeling Using Spatial Auto-Regression

Parameter estimation of the spatial auto-regression model (SAR) is important because we can model the spatial dependency, i.e., spatial autocorrelation present in the geo-spatial data. SAR is a popular data mining technique used in many geo-spatial application domains such as regional economics, ecology, environmental management, public safety, public health, transportation, and business. Howev...

متن کامل

Investigation and analysis of cycles and spatial correlation model of Iranian monthly rainfalls

The purpose of this study is to analyze and analyze Iran's precipitation over the past half-century(1967-2017). For this purpose, the average monthly rainfall of Iran during the statistical period of 50 years was extracted from Esfazari databases (Which is provided using data from 283 stations of Synoptic and Climatology). Regression analysis was used to analyze the trend and to analyze the ann...

متن کامل

Spatial Varying Coefficient Regression Model For Relative Risk Factors of Esophageal Cancer Patients

In conventional methods for spatial survival data modeling, it is often assumed that the coefficients of explanatory variables in different regions have a constant effect on survival time. Usually, the spatial correlation of data through a random effect is also included in the model. But in many practical issues, the factors affecting survival time do not have the same effects in different regi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004